Accurate estimation of homologue-specific DNA concentration-ratios in cancer samples allows long-range haplotyping
نویسندگان
چکیده
Interpretation of allelic copy measurements at polymorphic markers in cancer samples presents distinctive challenges and opportunities. Due to frequent gross chromosomal alterations occurring in cancer (aneuploidy), many genomic regions are present at homologus-allele imbalance. Within such regions, the unequal contribution of alleles at heterozygous markers allows for direct phasing of the haplotype derived from each individual parent. In addition, genome-wide estimates of homologue specific copyratios (HSCRs) are important for interpretation of the cancer genome in terms of fixed integral copy-numbers. We describe HAPSEG, a probabilistic method to interpret biallelic marker data in cancer samples. HAPSEG operates by partitioning the genome into segments of distinct copy number and modeling the four distinct genotypes in each segment. We describe general methods for fitting these models to data which are suitable for both SNP microarrays and massively parallel sequencing data. In addition, we demonstrate a specially tailored error-model for interpretation of systematic variations arising in microarray platforms. The ability to directly determine haplotypes from cancer samples represents an opportunity to expand reference panels of phased chromosomes, which may have general interest in various population genetic applications. In addition, this property may be exploited to interrogate the relationship between germline risk and cancer phenotype with greater sensitivity than is possible using unphased genotype. Finally, we exploit the statistical dependency of phased genotypes to enable the fitting of more elaborate sample-level error-model parameters, allowing more accurate estimation of HSCRs in cancer samples.
منابع مشابه
Using an electrochemical nanobiosensor based on titanium carbide-carbon nanotubes polymeric nanocomposite for the epithelialovarian cancer early detection
Background & Aim: Ovarian cancer is the most lethal among female malignancies. So far, treatment improvements have affected patient survival, but it is still more important to provide an early diagnosis that can detect the disease in its early stages. Therefore, introducing a rapid, accurate, and low-cost method to disease detection can be important and necessary. Methods: This study introduce...
متن کاملMolecular haplotyping of genetic markers 10 kb apart by allele-specific long-range PCR.
Haplotypes, combinations of polymorphic markers in a chromosome, are critical for genome diversity research. However, their utility in population samplings is compromised by uncertain linkage phase determinations from unrelated individuals. Molecular haplotyping accomplishes direct phase determination by generation of hemizygous templates from diploid genomic samples. We report molecular haplot...
متن کاملRapid, long-range molecular haplotyping of thiopurine S-methyltransferase (TPMT) *3A, *3B, and *3C.
BACKGROUND Haplotyping is an important technique in molecular diagnostics because haplotypes are often more predictive for individual phenotypes than are the underlying single-nucleotide polymorphisms (SNPs). Until recently, methods for haplotyping SNPs separated by kilobase distances were laborious and not applicable to high-throughput screening. In the case of thiopurine S-methyltransferase (...
متن کاملI-44: Concurrent Whole-Genome Haplotyping and Copy-Number Profiling of Single Cells
Background Methods for haplotyping and DNA copynumber typing of single cells are paramount for studying genomic heterogeneity and enabling genetic diagnosis. Before analyzing the DNA of a single cell by microarray or next-generation sequencing, a whole-genome amplification (WGA) process is required, but it substantially distorts the frequency and composition of the cell’s alleles. As a conseque...
متن کاملAn efficient haplotyping method with DNA pools.
Determination of haplotype frequencies (the joint distribution of genetic markers) in large population samples is a powerful tool for association studies. This is due to their greater extent of polymorphism since any two bi-allelic single nucleotide polymorphisms (SNPs) generate a potential four-allele genetic marker. Therefore, a haplotype may capture a given functional polymorphism with highe...
متن کامل